4,314 research outputs found

    A model-based multithreshold method for subgroup identification

    Get PDF
    Thresholding variable plays a crucial role in subgroup identification for personalizedmedicine. Most existing partitioning methods split the sample basedon one predictor variable. In this paper, we consider setting the splitting rulefrom a combination of multivariate predictors, such as the latent factors, principlecomponents, and weighted sum of predictors. Such a subgrouping methodmay lead to more meaningful partitioning of the population than using a singlevariable. In addition, our method is based on a change point regression modeland thus yields straight forward model-based prediction results. After choosinga particular thresholding variable form, we apply a two-stage multiple changepoint detection method to determine the subgroups and estimate the regressionparameters. We show that our approach can produce two or more subgroupsfrom the multiple change points and identify the true grouping with high probability.In addition, our estimation results enjoy oracle properties. We design asimulation study to compare performances of our proposed and existing methodsand apply them to analyze data sets from a Scleroderma trial and a breastcancer study

    A Global SU(5) F-theory model with Wilson line breaking

    Full text link
    We engineer compact SU(5) Grand Unified Theories in F-theory in which GUT-breaking is achieved by a discrete Wilson line. Because the internal gauge field is flat, these models avoid the high scale threshold corrections associated with hypercharge flux. Along the way, we exemplify the `local-to-global' approach in F-theory model building and demonstrate how the Tate divisor formalism can be used to address several challenges of extending local models to global ones. These include in particular the construction of G-fluxes that extend non-inherited bundles and the engineering of U(1) symmetries. We go beyond chirality computations and determine the precise (charged) massless spectrum, finding exactly three families of quarks and leptons but excessive doublet and/or triplet pairs in the Higgs sector (depending on the example) and vector-like exotics descending from the adjoint of SU(5)_{GUT}. Understanding why vector-like pairs persist in the Higgs sector without an obvious symmetry to protect them may shed light on new solutions to the mu problem in F-theory GUTs.Comment: 95 pages (71 pages + 1 Appendix); v2 references added, minor correction

    Frequency Domain Estimation of Continuous Time Cointegrated Models with Mixed Frequency and Mixed Sample Data

    Get PDF
    Recent work by the author on mixed frequency data analysis has focused on the estimation of cointegrated systems in continuous time based on a fully specified dynamic system of equations, while the estimation of cointegrating vectors in a discrete time system has been approached using a semiparametric frequency domain estimator. We extend the latter approach to cover the continuous time case, establishing the asymptotic properties of the frequency domain estimator and explore, in a simulation study, the effects of misspecifying the continuous time dynamic model in discrete time compared to treating the dynamics non‐parametrically. An empirical illustration is also provided

    Heat transport in insulators from ab initio Green-Kubo theory

    Full text link
    The Green-Kubo theory of thermal transport has long be considered incompatible with modern simulation methods based on electronic-structure theory, because it is based on such concepts as energy density and current, which are ill-defined at the quantum-mechanical level. Besides, experience with classical simulations indicates that the estimate of heat-transport coefficients requires analysing molecular trajectories that are more than one order of magnitude longer than deemed feasible using ab initio molecular dynamics. In this paper we report on recent theoretical advances that are allowing one to overcome these two obstacles. First, a general gauge invariance principle has been established, stating that thermal conductivity is insensitive to many details of the microscopic expression for the energy density and current from which it is derived, thus permitting to establish a rigorous expression for the energy flux from Density-Functional Theory, from which the conductivity can be computed in practice. Second, a novel data analysis method based on the statistical theory of time series has been proposed, which allows one to considerably reduce the simulation time required to achieve a target accuracy on the computed conductivity. These concepts are illustrated in detail, starting from a pedagogical introduction to the Green-Kubo theory of linear response and transport, and demonstrated with a few applications done with both classical and quantum-mechanical simulation methods.Comment: 36 pages, 14 figure

    Goodness-of-Fit Tests for Symmetric Stable Distributions -- Empirical Characteristic Function Approach

    Full text link
    We consider goodness-of-fit tests of symmetric stable distributions based on weighted integrals of the squared distance between the empirical characteristic function of the standardized data and the characteristic function of the standard symmetric stable distribution with the characteristic exponent α\alpha estimated from the data. We treat α\alpha as an unknown parameter, but for theoretical simplicity we also consider the case that α\alpha is fixed. For estimation of parameters and the standardization of data we use maximum likelihood estimator (MLE) and an equivariant integrated squared error estimator (EISE) which minimizes the weighted integral. We derive the asymptotic covariance function of the characteristic function process with parameters estimated by MLE and EISE. For the case of MLE, the eigenvalues of the covariance function are numerically evaluated and asymptotic distribution of the test statistic is obtained using complex integration. Simulation studies show that the asymptotic distribution of the test statistics is very accurate. We also present a formula of the asymptotic covariance function of the characteristic function process with parameters estimated by an efficient estimator for general distributions

    Wavefunctions and the Point of E8 in F-theory

    Get PDF
    In F-theory GUTs interactions between fields are typically localised at points of enhanced symmetry in the internal dimensions implying that the coefficient of the associated operator can be studied using a local wavefunctions overlap calculation. Some F-theory SU(5) GUT theories may exhibit a maximum symmetry enhancement at a point to E8, and in this case all the operators of the theory can be associated to the same point. We take initial steps towards the study of operators in such theories. We calculate wavefunctions and their overlaps around a general point of enhancement and establish constraints on the local form of the fluxes. We then apply the general results to a simple model at a point of E8 enhancement and calculate some example operators such as Yukawa couplings and dimension-five couplings that can lead to proton decay.Comment: 46 page

    Adaptive Evolutionary Clustering

    Full text link
    In many practical applications of clustering, the objects to be clustered evolve over time, and a clustering result is desired at each time step. In such applications, evolutionary clustering typically outperforms traditional static clustering by producing clustering results that reflect long-term trends while being robust to short-term variations. Several evolutionary clustering algorithms have recently been proposed, often by adding a temporal smoothness penalty to the cost function of a static clustering method. In this paper, we introduce a different approach to evolutionary clustering by accurately tracking the time-varying proximities between objects followed by static clustering. We present an evolutionary clustering framework that adaptively estimates the optimal smoothing parameter using shrinkage estimation, a statistical approach that improves a naive estimate using additional information. The proposed framework can be used to extend a variety of static clustering algorithms, including hierarchical, k-means, and spectral clustering, into evolutionary clustering algorithms. Experiments on synthetic and real data sets indicate that the proposed framework outperforms static clustering and existing evolutionary clustering algorithms in many scenarios.Comment: To appear in Data Mining and Knowledge Discovery, MATLAB toolbox available at http://tbayes.eecs.umich.edu/xukevin/affec

    Structure in 6D and 4D N=1 supergravity theories from F-theory

    Get PDF
    We explore some aspects of 4D supergravity theories and F-theory vacua that are parallel to structures in the space of 6D theories. The spectrum and topological terms in 4D supergravity theories correspond to topological data of F-theory geometry, just as in six dimensions. In particular, topological axion-curvature squared couplings appear in 4D theories; these couplings are characterized by vectors in the dual to the lattice of axion shift symmetries associated with string charges. These terms are analogous to the Green-Schwarz terms of 6D supergravity theories, though in 4D the terms are not generally linked with anomalies. We outline the correspondence between F-theory topology and data of the corresponding 4D supergravity theories. The correspondence of geometry with structure in the low-energy action illuminates topological aspects of heterotic-F-theory duality in 4D as well as in 6D. The existence of an F-theory realization also places geometrical constraints on the 4D supergravity theory in the large-volume limit.Comment: 63 page

    Probabilistic Clustering of Time-Evolving Distance Data

    Full text link
    We present a novel probabilistic clustering model for objects that are represented via pairwise distances and observed at different time points. The proposed method utilizes the information given by adjacent time points to find the underlying cluster structure and obtain a smooth cluster evolution. This approach allows the number of objects and clusters to differ at every time point, and no identification on the identities of the objects is needed. Further, the model does not require the number of clusters being specified in advance -- they are instead determined automatically using a Dirichlet process prior. We validate our model on synthetic data showing that the proposed method is more accurate than state-of-the-art clustering methods. Finally, we use our dynamic clustering model to analyze and illustrate the evolution of brain cancer patients over time
    • 

    corecore